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TOWARD  A  THEORY  OF  NONLINEAR  STOCHASTIC  REALIZATION 

+  ■  § 

Anders  Lindquist  ,  Sanjoy  Mitter' ,  and  Giorgio  Picci 

1.  INTRODUCTION 

The  following  is  a  central  problem  in  stochastic  systems  theory:  Given  a  station¬ 
ary  stochastic  process  {y(t);tsS},  find  a  (possibly  infinite-dimensional)  vector 
Markov  process  {xCt);t€ll},  called  the  state  process,  and  a  function  f  so  that  yCt)  = 
^CxCt))  for  ail  ten.  Moreover,  find  a  stochastic  differential  equation  driven  by  a 
Wiener  process  and  having  the  state  process  x  as  its  unique  solution.  The  problem  of 
characterizing  the  family  of  all  such  representations  is  known  as  the  stochastic  reali¬ 
zation  problem. 

There  is  by  now  a  rather  comprehensive  theory  of  stochastic  realization  for  the 
case  that  {yCt);t€M}  is  Gaussian  [1-3],  in  which  case  the  representations  can  be  taken 
to  be  linear,  i.e.  both  f  and  the  stochastic  differential  equation  are  linear.  This 
linear  theory  can  be  applied  to  non-Caussian  processes  also,  but  -then  we  need  to  give 
up  the  requirement  that  x  is  Markov  and  that  it  is  generated  by  a  Wiener  process,  re¬ 
placing  these  concepts  by  "wide  sense  Markov”  [4]  and  ’’orthogonal  increment  process" 
respectively.  If  we  are  not  willing  to  do  so,  a  nonlinear  stochastic  realization 
theory  is  needed.  That  is  the  topic  of  this  paper. 

In  this  paper  we  shall  apply  Wiener's  theory  of  homogeneous  chaos  [5,6]  to  the 
nonlinear  stochastic  realization  problem.  For  simplicity  and  ease  of  notation  we  shall 
assxime  that  the  process  y  is  scalar,  although  the  machinery  which  we  develop  is  suffi¬ 
cient  to  accommodate  also  the  vector  case.  Other  assumptions,  such  as  y  admitting  an 
innovation  representation,  are  however  crucial  to  our  approach,  (In  this  respect,  it 
might  be  more  appropriate  to  consider  a  process  y  with  stationary  increments,  and  indeed 
with  minor  modifications  we  could  have  done  so.)  In  the  extension  of  this  work  we  see 
the  possibility  of  making  contact  with  nonlinear  filtering  [7,8]  and  that  is  partially 
a  rotivation  for  this  work, 

2.  PROBLEM  FORMULATION 

Let  {y(t);tell}  be- a  non-Gaussian  stationary  stochastic  process  which  is  mean- 
square  continuous,  purely  nondeterministic,  and  centered,  and  let  Y  be  the  sigma-field 
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generated  by  y.  Then  define  H  to  be  the  Hilbert  space  of  all  centered  / -measurab 1 e 
random  variables,  having  inner  product  <C,ri>  = E{5n} .  Since  y  is  stationary  there  is  a 
strongly  continuous  group  of  unitary  operators  {U^;tell}  on  H,  called  the  shift,  such 
that  yCt+s)  =U^y(s3  for  all  t  and  s  [9].  Let  and  be  the  sigma-fields  generated 
t>y  {yCs);s<t}  and  {yCs);s2:t}  respectively. 

Next  assume  that  y  has  an  innovation  process  {v(t);t€R},  by  which  we  shall  here 
mean  a  Wiener  process  such  that  a{v(T) -v(a) ;T,a <  t}  =y^.  (Here  a{*}  denotes  the  sigma- 
field  generated  by  the  random  variables  inside  the  curly  brackets.)  Then,  by  symmetry, 
it  also  has  a  backward  innovation  process  {v(t);te31},  i.e.  another  Wiener  process  such 
that  oIvCt) -'3(a)  ;T,a  >  t}  Now,  since  =  a{v(t) ;  t  eR}  =a{v(t);t£R}  we  can  apply 

Wiener’s  homogeneous  chaos  theory  [5,6].  Let  H^  denote  the  Gaussian  space  [5]  generated 
by  {v(t);t€R}  or,  which  is  equivalent,  by  {'3(t);t€R}.  Since  y  is  mean-square  contin¬ 
uous,  Hj  is  a  separable  space,  and  therefore  has  a  countable  orthonormal  basis 

Now,  let  be  the  (closed)  linear  subspace  of  all  polynomials  in  of 

degree  not  exceeding  n.  Next  define  =  ?  ©P  i.e.  the  orthogonal  complement  of 
P  ,  in  P  .  Then  it  can  be  shown  [5,6]  that 

H  =  Hj_  ®  H2  ®  Hj  ®  ,  .  .  (1) 

where  ®  denotes  orthogonal  direct  sum.  The  space  H^  is  called  the  homoger^ous  ohaos 
of  H.  Since  y(0)  g  H,  there  is  an  orthogonal  decomposition 

y(0)  =  yj^CO)  +  y2(0)  +  y3(0)  +  ...  (2) 

where  y_(0)  eH  .  It  is  easy  to  see  that  each  chaos  H  is  invariant  under  the  shift  U. , 
n  n  n  L 

and  consequently,  for  any  t  gR,  we  have  a  decomposition  such  as  (2)  for  y(t)  in  terms 
of  y^(t)  :=U^yjj(0),  n=l,2,3... 

In  order  to  obtain  a  state  space  description  we  introduce  a  gast  space  H  and  a 

4.  •  .fi  « 

future  svaae  H  as  follows.  Let  H  (H  )  be  the  subspace  of  all  centered  /  -measurable 
(y^ -measurable)  random  variables.  Then,  defining  H^  and  H^  to  be  the  Gaussian  spaces 
generated  by  {v(t) -v(a) ;T,a s  0}  and  {v(x) -v(a) ;T,a  > 0}  respectively,  we  obtain  the 
chaos  expansions 

H"  =  Hr  ©  Hi  ©  H3  ®  ...  (3a) 

I  1  2  0 

H"^  =  hJ^  ©  HiJ  ©  H3  ®  ...  (3b) 

where  clearly  H”  c  H  and  H^cH  for  all  n.  Note  that  hJ  n  0  and  v  =  H, ,  but 
nnnn  11  111 

H~  vH‘*’j:H. 

Now,  if  y  were  a  Gaussian  process,  y  would  have  a  component  only  in  the  first 
chaos,  i.e.  y  =  yj,  and  consequently  state  spaces  for  y  could  be  constructed  along  the 
lines  of  [1,2]  by  finding  the  minimal  Markovian  (Hr,H^) -splitting  subspaces  in  H^. 

[We  recall  that,  for  two  subspaces  A. and  B,  X  is  an  {A ,B) -splitting  subspaae  if 

XX  X  * 

<E  a,E  B>=<a,g>  for  all  aeA  and  g  e  B,  where  E"  denotes  orthogonal  projection  on  the 

subspace  X,]  However,  for  a  non-Gaussian  process  y,  there  will  be  some  nontrivial  com¬ 
ponent  y^,  n>l,  and  consequently  the  state  space  construction  will  have  to  involve  at 
least  those  higher  chaoses  in  which  y  has  a  component.  To  this  end  define  the  index  set 


N  :={n  ]  Xj^CO)  0}  u{l}.  For  reasons  which  will  soon  be  evident,  we  shall  have  to  al¬ 
ways  include  the  first  chaos  in  otir  analysis.  (In.  particular,  see  Section  7.) 

Hence  we  call  X  a  state  space  for  y  if  . 

X  =  ®  X  (4) 

n^N  " 

*  . 

where  X^cH^  is  an  -splitting  subspace,  and  X  is  Markovian  in  the  sense  that, 

if  X  :  =  a{X},  X  and  X  :  =  cr{Y^>QU^X}  ,  X  and' X'*’  are  conditionally  indepen¬ 

dent  given  X;  we  shall  write  this  X'liX'*’  |  X.  We  say  that  X  is  minimal  if  there  is  no 
other  state  space  X'  for  which  X'  :=a{X'}  is  properly  contained  in  X. 

The  problem  at  hand  is  now  to  construct  all  minimal  state  spaces  for  y  and  to  ob¬ 
tain  a  dynamical  representation  (realization^  for  each  of  them. 


3.  THE  STRUCTURE  OF  H 

According  to  ItB's  Theorem  [10] 

H 


n 


{I^Cf;v)  1  f  e 


where  is  the  multiple  Wiener  integral 

.^1  r^n-1 


i,Cf;v)  - 


fCtj,t2,...,t^}dv(tj} ...dvCt^) 


(S) 


(6) 


and  £20^^^  are  the  symmetric  functions  in  L.,(]R^).  Although  the  region  of  integration 


is  such  that  (5)  remains  the  same  if  L2CH^)  is  exchanged  for  we  prefer  the 

"  '^11 

former  since  we  have  a  one-one  correspondence  between  elements  in  and  L2CR  ).  In 
fact,  we  can  establish  an  isometric  isomorphism  between  these  spaces  [5,6,10] ,  Now, 
for  i=l,2,...,n,  let  e  be  arbitrary.  Then  there  exist  unique  functions  f^€L20R) 
such  that  = /”^f^Ct)dvCt) .  Next  define 


where 


nj  »  Jlj  .  ...  *  =  n! 


f(tj_,t2,...,t^3  =  f  (tj^]f  ct2}...f  (t^) 

me  G  1  2  n  . 


C7) 

C8) 


G  being  the  symmetric  group  of  permutations  of  n  letters.  Since  finite  linear  combina¬ 


tions  of  functions  of  type  (8)  are  dense  in  L20R  ),  Ito’s  Theorem  implies  that 


Hjj  =  sp{n^  *  n2  *  •  *  %  1  ,n2.  •  •  •  e 


where  sp  denotes  closed  linear  hull.  We  shall  write  this  as 

H  =  H.  *  H.  *  . . .  *  H,  . 
nil  1 

By  Ito's  formula  [11;  p.38] 

*  n2  *  •  •  •  *  =  Cri2  *  '^3  *  •  • '  *  %)  * 


(9a) 


(9b) 


which  can  be  solved  recursively.  For  example. 


(10) 
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n  1  *  n  2  =  n  in  2  ■  1  2> 

ni*n2*n3  =  nin2n3  -ni  •  <n2»n3>  -n2  •  <^i'n3>  -n3  •  <ni,n2> 
The  * -operation  is  obviously  commutative.  In  particular, 

p  *  p  *  . . .  *  p 


hj^Cn  j<n  »n> ') 


Cii) 


n  times 

(in  the  sequel  we  shall  write  this  p^*)  where 

,n 


bjj(x,a) 


n ! 


exp 


f  2l 

X  ! 

a”  exp' 

_2aj 

ax"  " 

2“J 

(12) 


n=0,l,2,...,  are  the  Eevmite  poZynovriats  (cf  [11;  p.37]).  Analogously  to  (9)  we  have 

*  H’  C13a) 

C13b) 


H  =  H,  *  H,  * 
nil 


*  . . .  *  H-"' 
nil  1 


,n® 


Let  ®Hj  ®  —  9  denote  the  symmetric  tensor-product  Hilbert  space  of 


by  itself  taken  n  times.  Then  for  arbitrary  5^,p^£H^,  i=l,2,..,,n,  with  <•,*> 


inner  product  in  we  have 


n® 


the 


<5^952®  •••  ®Cn»ni®n2‘ 


V„.-n- -niX?  .n2>...<5^  '  C14) 

n®  ireG  12  n 


where  5i®?2®  symmetric  tensor  product  [5,6].  Since  finite  linear  com¬ 

binations  of  such  tensor  products  are  dense  in  H^®,  it  is  now  easy  to  see  that  is 
isometrically  isomoiTJhic  to  and  hence  to  H^*.  For  n=2  we  can  illustrate  this 

by  factoring  the  symmetric  bilinear  map  (pj^,P2)  ■*'ni*n2  follows: 


«1  \«1 


■^«2=»1  *»1 


Hi  ®  H^ 

where  is  the  unique  linear  map  which  makes  the  diagram  commute;  <1)2  is  unitary.  Sim¬ 
ilar  unitary  maps  ({j^  are  defined  for  n=3,4,. 

If  A^,A2,...,  are  linear  operators  in  H^ ,  we  define  * _ *A^  :  via 

CA^*A2*...*A^)(p^*P2*...*n^)  =  (A^pp  *  CA2n2)  *  *  (A^h^) 

on  a  dense  set  in  H^  and  then  extend  it  continuously  to  all  of  H  .  We  define 
A^®'A2  9...  ®A^  :  -^H^  analogously.  For  n=2  we  have  then  the  following  picture: 


H, 


^  *  ^2 


H, 


H^  @  Hj^ 


A,  9  A, 


►H^  9  Hj^ 


and  analogously  for  n>2. 
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4.  STATE  SPACE  CONSTRUCTION 

THEOREM  1 .  The  subsvace  X  c  H  is  a  rrvCmmxl  state  space  for  y  if  and  only  if 

X  =  ®  X  C15a) 

neN 

where  is  a  minimal  Markovian  -splitting  svJospaae  and 

\  =  \  :=  Xi  *  Xi  *  ...  *  .X^  (15b) 

(n  times).  Then  each  X^  is  a  minimal  -splitting  subspaae. 

The  proof  of  this  theorem  is  based  on  the  following  lemmas. 

LEMMA  1.  Let  p  =p^  ®  ®  •  ■  ■  ®  ^  X  be  a  sidospaae 

of  Then 

_X9X9...®X  ,_X  ,  _X  ,  ._x  , 

E  n  =  (E  Pj)  9  (E  n2)  ®  •••  ®  (2  ^2)  • 


PROOF.  Let  f).  ;=E'p.,  and  let  C  =  Ci  ®  9  •  •  •  ®  5  where  •••  jC  are  arbitrary  ele- 

^  X  X  ^  Hi  X  T1 

ments  in  X.  Then,  by  (14) , 

<p^®P2® . .  .®p^-f!j^®P2® . .  .®f!^> 

n® 

<(ni-n]_)®ri2®- +<n2^®Cn2-n2)®-*-®nn»5>  +  . .. +<ni®fi2®. .  .®Cn^-n^) 

n®  n®  ^  n® 

ttcG  1  2  n  1  2  n 

...  ><02.5^  ’ 

In  n 

which  equals  zero  since  p^-p^iX.  □ 


LEMMA 


Let  Xj^  he  a  subspaae  of  and  let  X^  be  defined  by  (15b).  Then  X^  is  an 


-splitting  svhspaae  if  and  only  if  X^  is  an  {A^,A^ -splitting  subspaae. 


n® 


PROOF.  Due  to  isomorphism,  we  can  identify  H^,  and  X^  with  (Hj^)^®,  (H^)^®  and  X^ 
respectively.  But  Lemma  1,  (14),  and  the  definition  of  splitting  subspace  imply  that 
1  X  if  and  only  if  (H^)”®i  (H^)^®  1  X^®.  Hence  the  lemma  follows.  □ 

LEMMA  3.  Let  X^  and  X^  be  the  splitting  subspaaes  of  Lemma  2.  Then  x^^  is  minimal  if 
and  only  if  X^  is  minimaVL 

PROOF.  By  Proposition  1  in  [12]  it  suffices  to  show  that  the  condition  E'^^H~=X,  is 
equivalent  to  c  =  X^  (bar  over  E  stands  for  closure)  and  that  =  Xj  is  equivalent 

to  E^’^H^  =  Xj^.  By  isomorphism  E^^H^  =  Xj^  can  be  identified  with  E^^  (H^)^®  =  X^®,  which, 
by  Lemma  1,  holds  if  and  only  if  E^^H^  =  X.  A  similar  argument  establishes  the  other 
equivalence.  □ 

PROOF  OF  THEOREM  1.  Let  the  X  described  in  the  theorem  be  denoted  X,  and  set  X  :  =  a(Xj^). 
Then  X  is  the  space  of  all  centered  X-measurable  random  variables  in  H  [5,6],  and 
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aCX)  Moreover,  if  X  -^V^gU^X,  it  is  not  hard  to  see  that  X"  :=a(X")  =a(X“),  since 

U^X^ =  (U^Xj)^* .  An  analogous  relation  holds  for  summation  over  the  future. 

(1f):  Show  that  X  is  a  minimal  state  space.  Since  X^^  is  Markovian,  so  is  X.  Hence, 
in  view  of  Lemma  2,  X  is  a  state  space  for  y.  Now  assume  that  X  is  not  minimal.  Then 
there  is  a  state  space  X  such  that  X  :=a(X)  is  properly  contained  in  X.  Then,  since 
all  5  €  X  are  ^-measurable  and  X  is  the  space  of  all  centered  X-measurable  random  vari¬ 
ables,  XcX.  Therefore  X^  must  be  a  proper  subset  of  X^,  or  else  XsaCXj^)  =a(X^)  =X. 
This  contradicts  the  minimality  of  X^ . 

(only  if):  Let  X  be  a  minimal  state  space  for  y.  First  let  us  assume  that  X,  is  not 
minimal.  Then  there  is  another  (H^ ,Hj^) -splitting  subspace  X^  which  is  a  proper  sub¬ 
space  of  Xj^.  Let  X=aCX^),  and  let  X  be  the  space  of  all  ^-measurable  elements  in  H. 
Clearly  XcX  :  =  aCX).  We  want  to  show  that  this  inclusion  is  proper,  contradicting 
minimality  of  X.  But  this  is  the  case,  for  there  is  a  ? e  X^  such  that  5  i  X^ .  Conse¬ 
quently,  by  the  Gaussian  property,  a{5}  and  X  are  independent,  while  both  are  subfields 
of  X.  Hence  X,  must  be  minimal,  and  X,  =X,  .  Next  assume  that  X  is  not  of  the  form 

i  ^  ^  J.  i  ^  T1 

(15b)  j  i.e.  X^siX^.  Then  since  X^^  is  minimal  (Lemma  3),  X^+X^,  i.e.  there  is  a  5  e  X 
which  does  not  belong  to  X  and  consequently  is  not  X-measurable.  Hence  X  is  a  proper 
subfield  of  X  contradicting  minimality  of  X.  Therefore  X =  X.  Finally  X  is  Markovian 
only  if  is  Markovian.  The  last  statement  of  the  theorem  follows  from  Lemma  3.  □ 

5.  THE  STATE  SPACE  COMPONENT  OF  THE  FIRST  CHAOS 

Thus  it  remains  to  determine  the  minimal  Markovian  (H~,Hp -splitting  subspaces  X^ . 
This  is  almost  the  problem  solved  in  [1-3],  To  explain  how  it  differs,  let  ^ e H^ n H^ 
be  defined  in  the  following  manner.  If  yj^(O)  ^0,  set  ?:=yj^(0},  otherwise  let  it  be 
arbitrary.  (Remember  that  H“nHj?t0.)  Next  define  the  process  2(t)  :  =  Then 

z(t)  £ for  all  t.  Moreover, 

H'(z)  c  H'  (16a) 

H*Cz)  c  hJ"  (16b) 

where  H^(z)  and  Hj^(z)  are  the  closed  linear  hulls  of  the  random  variables  {z(t);tsO} 

and  {z(t);t>0}  respectively.  Since  y  is  purely  nondeterministic  and  mean-square  con¬ 
tinuous,  so  is  z.  Therefore  z  has  a  spectral  density  <l(ito),  A  scalar  solution  W  of 
the  equation 

W(s)W(-s)  =  <|(s)  (17) 

will  be  called  a  (full-rank)  speotral  faatoz'  of  z.  Now,  if  y  is  Gaussian  as  assmed 
in  [1,2],  z-y  and  we  have  equality  in  each  of  relations  (16).  Then  there  is  a  proce¬ 
dure  in  [1,2]  to  determine  X^^  from  a  certain  pair  (W,W)  of  spectral  factors.  However, 
in  the  non-Gaussian  case,  z*y,  and  we  cannot  assume  that  relations  (16)  hold  with 
equality,  not  even  when  z  =  y^^ .  Hence  there  is  a  "mismatch”  between  the  process  z  and 
the  geometry  in  H^^,  and  consequently  the  procedure  of  [1,2]  will  have  to  be  modified. 


Fortunately  the  basic  results  of  [1,2]  depend  in  no  crucial  way  on  the  spectral 
factor  construction.  The  following  result  found  in  [1,2]  is  a  consequence  of  the  geom¬ 
etry  in  only.  The  theorem  requires  some  new  notation;  For  any  Wiener  process 
{uCt)  ;t  £  H}  c  ,  let  H^C<iu3  and  H^Cdu)  be  the  Gaussian  spaces  generated  by  the  incre¬ 
ments  {u(t3-uCct)  ;T,a  <  0}  and  {u(t) -u(a)  ;t  ,cr  a  0}  respectively.  In  particular,  we  have 
Hj^(do)  =H^  and  H^(dv)  =Hj.  Here  and  in  the  sequel,  when  we  talk  of  a  "Wiener  process,” 
we  shall  always  mean  a  centered  Gaussian  process  defined  on  the  whole  veal  line  by  a 
spectral  representation 


u(t) 


fCO 

. 


e 


ixot 


-1 


lOJ 


du(i;j)  , 


CIS) 


-oo 


where  du  is  a  Gaussian  orthogonal  stochastic  measure  such  that  E  du| 


2Tr 


do), 


THEOREM  2.  A  stibspace  Xj^  =  is  a  minimal  Markovian  -splitting  svbspaae 

only  if 

X  =  H'Cdu)  e  H'Cdu) 

fop  some  pair  Cu,u)  of  WieTier  proaesses  in  sxiah  that 

H'(du)  c  H’(du) 

Hi  c  H~(du) 

H^  c  H^CdCi) 

Hj^Cdu)  =  hJ^  V  Hj^Cdu) 

H"Cdu)  =  H"  V  H'(du)  . 


if  arid 
(19) 

(20a) 

(20b) 

(20c) 

(20d) 

C20e) 


The  processes  u  and  u  (which  are  essentially  unique)  are  called  respectively  the 
forward  and  the  baakward  generating  proaesses  of  X.  (Condition  (20a)  is  equivalent  to 
H^(du)  and  H^(du)  intersecting  perpendicularly.  Moreover,  (20d)  is  an  observability 
and  (20e)  a  constructibility  condition  [1,2].) 

The  Gaussian  space  of  any  Wiener  process  u  in  Hj  coincides  with  H^  [9],  and  conse¬ 
quently  any  n  s  can  be  written 


n 


J  f(-t)du(t) 

-oo 


(21a) 


where  f€L20R),  or  equivalently, 

n 


f  (ia))du(ici)) 


C21b) 


/N 

where  oj-s-fCioj)  is  the  Fourier  transform  [9].  [We  shall  refer  also  to.  the  function  f 
as  the  Fourier-transform,  although  it  properly  should  be  called  the  (double-sided) 
Laplace-transform.]  Relations  (22)  establishes  an  isometric  isomorphism  between  H^  and 
L2CII),  where  S  is  the  imaginary  axis.  Let  T^  :  Hj^-^-L2(lI)  be  the  map  T^q  =  f .  Then  it 
can  be  seen  that  T^  is  unitary.  Let  T*  denote  the  adjoint,  i.e.  n  = T*f ,  which  is  rela¬ 
tion  (21b)  .  The  shift  U.^.  corresponds  to  e^^  under  the  isomorphism  T^. 
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LEMMA  4.  There  is  a  one-one  aorresvondenae  hetmeen  wiener  processes  u  in  and.  spec¬ 
tral  factors  'd  of  z  described  by  the  following  rule.  For  each  Uj  W:=T  c  is  a  spectral 
factor.  For  each  spectral  factor  Wj  u  defined  by  CIS]  and  du  =  W  ^Gdv^  where  G 
is  a  Wiener  process. 

PROOF.  LetW:=T^C.  Then 

zCt)  =  e^^Wdu  ,  (22) 

. 

-OO 

from  which  it  is  easy  to  see  that  the  inverse  Fourier  transform  of  E{z(t)z(0)'}  is 
WCiio) WC-iijj) ,  establishing  W  as  a  spectral  factor.  In  particular  G  is  a  spectral  factor. 
Therefore,  for  any  spectral  factor  W,  du  :=W  ^Gdv  is  a  Gaussian  orthogonal  stochastic 
measure  such  that  Eldul  =^da),  for  dv  is.  Hence  (18)  is  a  Wiener  process.  □ 

Next  we  introduce  the  Hardy  spaces  and  :  Let  ^2(^2)  be  the  subspace  of  L^OI) 
of  functions  whose  inverse  Fourier -transforms  vanish  on  the  negative  (positive)  real 
line.  From  (21)  it  follows  that  T  H~(du)  and  T^H'^(du)  =f(’.  A  function  K  which  is 
bounded  and  analytic  in  the  open  left  half-plane  and  has  modulus  one  on  the  imaginary 
axis  is  called  inner.  Define  K*(ia))  ;=K(-iu);  K*  is  the  inverse  of  K.  If  f  e  and 
K  is  inner,  fKsH*,  and  H*K  is  a  subspace  of  H*.  Let  H(K)  denote  the  orthogonal  com¬ 
plement  of  ^2^  in  ff2. 

THEOREM  3.  Let  5  e  n  he  arbitrary,  and  set  G  :=T^C  and  G  :=T-^.  Let  T  :=G/G. 

Then  is  a  minimal  Markovian  -splitting  svbspaae  if  and  only  if  there  is  a 

pair  of  inner  functions  (Q,Q*)  such  that  K  :=rQQ*  is  also  inner,  K  and  Q  are  coprime, 

K  and  Q*  are  coprime,  and 

=  T^f/CK)  ,  (23) 

where  u  is  the  Wiener  process  (18)  with  du  =  Q*dv. 

PROOF.  We  present  an  appropriately  modified  version  of  the  proof  in  [1,2],  The  idea 
is  to  translate  conditions  (20)  to  the  Hardy  space  setting  and  apply  Beurling's  Theorem 
[13].  To  this  end,  first  note  that  if  u,  and  U2  are  two  Wiener  processes  in  H^,  and 
and  W^  are  their  corresponding  spectral  factors, 

Tu^n  =  (T^^^n)  (W2/WP  (24) 

for  any  peH^,  as  is  easily  seen  from  Lemma  4.  Then,  if  W  and  W  are  the  spectral  fac¬ 
tors  corresponding  to  u  and  u  respectively,  applying  the  map  T^  to  (20a),  (20b)  and 
(20e)  and  T-  to  (20c)  and  (20d)  yields 


C  where 

K 

:=  W/W 

(25a) 

H2Q  c  ^2  where 

Q 

:=  W/G 

(25b) 

H'Q  =  H"  where 

Q 

:=  W/G 

(25c) 

H’  =  (H’Q)  V  (H‘K*) 

(25d) 

H2  =  (H^Q)  V  (K2X)  . 

(25e) 
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Now,  since  H^(du)  is  invariant  under  the  shift  {U^;t<  0},  h'^K  is  invariant  under 
{e^^;t<0}.  Therefore,  by  Beurling's  Theorem  [13],  (2Sa)  holds  if  and  only  if  K  is 
inner.  In  the  same  way  we  see  that- (25b)  is  equivalent  to  Q  being  inner  and  C25c)  to 
Q  being  inner  with  respect  to  i-®-  Q*  inner.  Moreover,  (25d)  and  (25e)  are  valid 
if  and  only  if  the  stated  coprimeness  conditions  hold  [13],  and  (19)  is  equivalent  to 
TuX  9  (H^K)  =:H(K),  i.e.  (23).  The  statement  about  u  follows  from  Lemma  4.  □ 

REMARK.  Let  us  pinpoint  in  what  way  this  theorem  differs  from  the  corresponding  result 
in  [1,2].  In  the  case  studied  in  [1,2],  the  pairs  (W,W)  which  generate  splitting  sub¬ 
spaces  are  precisely  those  for  which  and  K  is  inner.  In  the  present 

setting  these  three  conditions  must  also  hold,  but  in  addition  we  must  have  W = GQ  and 
W = GQ.  These  factorizations  correspond  to  the  inner-outer  factorizations  of  [1,2], 
but  the  difference  is  that  now  G  and  G  are  not  outer.  Consequently,  some  of  the  pairs 
(W,W)  mentioned  above  will  be  excluded.  Note  that  the  innovation  process  does  not  cor¬ 
respond  to  an  outer  spectral  factor  of  z,  since  is  not  the  predictor  space  of  z.  □ 


6.  THE  STATE  PROCESS 

We  recall  from  Section  2  that 


yCt)  =  I  y„(t) 


neN 


(26a) 


where 


rt 


yjt)  = 


"n-l 


g^(t-tj,t-t2,...,t-t^)dv(t^)du(t2) ...du(t^)  (26b) 


for  some  g^sL2(R^).  Let  us  assume  that  this  innovation  representation  is  given,  i.e. 
that  the  functions  {g^jneN}  are  known. 

Let  us  now  consider  a  minimal  state  space  X  with  forward  generating  process  u. 
Then,  since  c  (du) , 

y^(0J  €  H"(du)  :=  H’(du)  *  H’(du)  *  ...  *  H‘(du)  (27) 


(n  times)  and  consequently  there  is  a  representation 


y^CO) 


'n-l 


Wj^(-ti,-t2,. . .  ,-tj^)du(tj)du(t2)  .. .  .du(tj^) 


(28) 


-00  -CO  -03 


for  some  ).  Defining  to  be  zero  whenever  some  argument  is  zero,  we  may  write 

this  yjj(O)  =I^(w^;u).  By  the  same  recipe  we  write  yjj(O)  =  ljj(g^;v)  .  We  need  to  deter¬ 
mine  w  from  g  ,  To  this  end,  let  fcL-Cl^)  be  the  n-fold  Fourier- transform 
n  ®n  2  ^  ^ 


^(i“r--'’^n)  =  f---. 


-io)^  t,  -  .  .  .  -ioi  t 

e  ^  ’^f(tj,...,t^)dt^..  .dtj^ 


—CO  -CD 


Let  :  L2(ll^)  ->L2(n^)  be  the  operator  defined  by  f  =  F^f.  The  following  is  a  multidi¬ 
mensional  version  of  (21) . 


LB'IMA  5.  Let  f  €L2CR^)  szrui  set  t  :=  F^f.  Let  u  be  a  (/-tener  process  (18)  .  Then- 

I  (f;u)  =  I  (f;u)  . 


(29) 
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PROOF.  First  let  f  be  of  the  form  (8).  Then  f  has  this  form  too,  and  f^  =  Fj^f^.  From 
(21)  we  have 


f, C-t)du(t) 


f .  (ia)  du(i!jj)  . 


Then,  by  itQ's  formula,  i.e.  (11)  with  v  exchanged  for  u  or  u,  each  member  of  (29)  can 
be  reduced  to  the  same  expression  in  n2^»ri2’ ’  ’ ' holds  for  functions  of 
t>’pe  (8).  Then,  since  finite  linear  combinations  of  functions  of  type  (8)  are  dense 
in  L2(R^)  or  L^CH^),  (20)  holds  in  general.  □ 

Consequently,  defining  W  :=F„w  and  G  :=F  g_,  w  can  be  determined  from  g  via 

n  n  n  n  n®n  n  °n 

the  relation 

W^(ia^,...  ,iaj^)  =  G^(k)^,...,ia^)Q(mi^)...Q(ia^)  ,  (30) 

for  dO  =  Qdu  (Theorem  3) . 

It  is  well-known  [6],  and  we  have  already  used  this  fact  in  Section  2,  that 
n  ?=  I_(f;u)  defines  an  isomorphism  between  H  and  L-CII^).  More  precisely  T^^^  :H  -^L-(E^) 

^  t-n'\  ^  ^  liHfc 

defined  by  T^“'^n=^j  is  a  map  with  the  property  that  (n!)”^^^-^  is  unitary.  The  space 
of  Fourier -transforms  of  functions  (such  as  w^^)  in  which  vanish  whenever  an  ar¬ 

gument  is  negative,  can  be  identified  with  (ht)^®  so  that  T^^^H’(du)  =  (H!!)’^®.  In  the 

sequel  we  shall  use  precisely  this  realization  of  the  tensor-product  Hilbert  space 
,+,n® 


(Hn)  Then  the  tensor  product  f^  ®  f2  ® .  ®  f^^  is  given  by  (8).  Also,  for  subspaces 

«n’ 

(n), 


A^,A2,  . . .  ,A^  in 


so  that  in  particular 

=  H(K)  ®  H(K)  ®  ...  @  H(K) 


(31) 

(32) 


(n  times).  Then,  since  yjj(O)  «  W^£H(K) 


na 


Following  [1,2]  we  say  that  X  is  regular  if  H(K)  contains  only  Fourier-transforms 
of  continuous  functions.  All  X  with  dim  X^<«>  are  clearly  regular.  It  can  be  shown 
[1,2]  that  if  X  is  regular  the  functional 


Vf  = 


(27r) 


n 


3R' 


n 


f(icjj^, . . .  ,ia)j^)dtu^  . .  .du^ 


(33) 


n®  —I  n® 

is  bounded  on  H(K)  .  Hence,  since  Vf  =  f(0)  where  f  =  F^  f ,  there  is  a  B^sH(K) 
such  that 


f(0)  =  <f,B„> 

"  HCK)"® 

(Riesz  Theorem).  Next,  as  in  [1,2],. we  define  a  strongly  continuous  semigroup 
{e^^;t  2:  0}  on  H(K)  by 


(34) 


jAtj  ^  pH(K)g-iaJtJ 


(35) 


where  P' 


H(K) 


denotes  the  orthogonal  projection  on  the  subspace  H(K).  Moreover  define 


,  n® 


the  linear  bounded  operator  C.^  :  K(K)  ->-R  given  by 
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C  f  =  <W  ,f> 
n  n 


(36) 


Then  the  following  lemma  is  a  multilinear  version  of  the  construction  in  [14,15]  which 
is  being  used  in  [1,2]. 


LEMMA  6.  The  integrand  in  (28)  admits  the  factorization 

r  At.  At-  At  • 

.  C„[e  3e 

. 

PROOF.  In  view  of  (34) 


'^n*'^l’^2’  ’ '  ’  ~  ^ 


"  'Hy  ,B  )  . 
n’  n' 


Since  W^eH(K)^,  we  have 


w„(VV-.V=<V^™^  ' 

which  is  the  required  result.  □ 

Consequently 


na  -id),  tj^-iW2t2“' •  • 


W  \ 

"  "b  ) 

n'^ 


,  na 


where  x^(t)  is  the  H(K)  -valued  process 


X  (t)  = 


t. 

t  , 

1 

r 

1 

1 

J 

A(t-t.) 

e  ®  . . .  a  e 


■••CO  •••CO  -•CO 


(37) 


Bjjdu(ti)  . .  .du(t^)  . 


(38) 


(39) 


If  H(K)  is  infinite  dimensional,  {x  (t);t£R}  is  not  an  ordinary  stochastic  process 
but  must  be  defined  in  a  weak  sense  [16].  Then  the  state  -proaess  {x(t);tcJR}  is  de¬ 
fined  as  the  (possibly  weakly  defined)  H(K)^®-valued  process  with  components  x^; 

neN.  This  terminology  is  motivated  by  the  following  result  developed  along  the  lines 
in  [1,2]. 


PROPOSITION  1.  Let  X  be  a  regular  state  space  and  let  x^  be  given  by  (39).  Then 


Moreover,  for  eaah  n  e  Nj 


\  X  H^(du)  . 


(40) 

(41) 


PROOF.  Let  C  €  bs  arbitrary,  and  let  f  :=T^  •'5  and  f  :=  Then,  by  Lemma  5, 


C  = 


■0  r^l  r^n-1 


f(tj,...,t^)du(t^)...du(tj^)  . 


(42) 


-OO  -CO  -CO 


By  exchanging  w^  for  f  in  the  proof  of  Lemma  6,  we  obtain 


-At, 


-At 


a  ...  0  e 


"  HCR™ 


(43) 
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Then  (42)  and  (43)  together  yield  (40).  In  view  of  (19),  c H^(du)  i  H^(du) ,  from 
which  (41)  follows.  □ 

In  partular,  for  n=l  we  can  write  (38)  and  (39)  in  the  following  suggestive  form 

(dx^  =  Ax^dt  +  B.du 

(44) 

/l  =  CjXj 

The  higher-chaos  subsystems  are  nonlinear.  In  the  next  section  we  shall  illustrate 
this  with  an  example. 

Note  that  a  backward  realization  for  X  generated  by  u  is  obtained  by  developing 
the  above  analysis  in  (Wp^  rather  than  in  (Hp^.  Whereas  the  forward  property  is 
characterized  by  (41),  the  backward  one  is  determined  by  X^iH^du)  for  each  n  e  N. 

Finally,  in  the  case  that  X  is  not  regular,  other  constructions  involving  rigged 
Hilbert  spaces  are  possible  [19] . 


7.  THE  FINITE-DIMENSIONAL  BILINEAR  CASE 

To  illustrate  our  point  let  us  consider  the  simplest  possible  nonlinear  problem. 
Let  the  process  y  have  the  innovation  representation 


y(t)  = 


g^(t-a)dv(a)  + 


g.,  C  t -T ,  t -a)  dv  (a)  dv  (  t) 


(45a) 


and  the  backward  innovation  representation 


y(t)  = 


g^(t-cr)dv(a)  + 


g2(t-T,t-a)dv(a)dv(T)  . 


(4  5b) 


Assume  that  ^  rational  fimction  which  is  not  identically  zero. 


Then 


same  properties,  and  y^^  si  0.  Moreover  y^  has  a  rational  spectral 
density,  namely  <l>(s)  ;=  Gps)G^(-s) . 

Now,  setting  F  :=G^/G^,  find  all  pairs  (Q,Q*)  of  inner  functions  such  that 
K  :=rQQ*  is  inner  and  coprime  with  Q  and  Q*.  For  each  such  solution  form 


= 


H(K)Q*dv  . 


(46) 


Theorem  3  states  that  the  X,  -spaces  obtained  in  this  way  are  precisely  the  minimal 
Markovian  (H^,H^) -splitting  subspaces.  In  particular,  = 1  yields  X^  = E  and 

=  1  yields  Since  F  is  rational,  it  can  be  shown  that  K  must  be  rational. 


and  consequently  X^  is  finite -dimensional  [17] 


In  fact,  all  X^  have  the  same  dimen¬ 


sion  n  [1,2].  By  using  the  procedure  described  in  Section  7  of  [IS]  we  can  determine 
an  nxn-matrix  .A^  and  an  nxl -matrix  B^  from  K  and  a  Ixn-matrix  from  W  :*GQ  so  that 


dx^  =  A^x^dt 

Xj  -  CjXj  , 


B^du 


(47) 


where  sp{x^(t) , . . .  ,x^(t)}  =  U^X^,  Hpdu)  i  X^  and 


1 3 


U(t)  = 


10) 


Q*(iaj)dv  . 


(48) 


To  each  there  corresponds  a  minimal  state  space,  namely 


where 


X  =  ©  X^ 


X^  =  X^  *  Xj 


C49a) 


(49b) 


Hence,  for  each  t,  the  5jn(n+l)  random  variables  ;=  x^(t)  *  (t) ;  j  s  i}  span  U^X.2. 

(Remember  that  x^^  =x^^.)  Let  {x2(t);t£ll}  be  the  %n(n+l) -^dimensional  stationary  vector 
process  with  components  x^^ (t) .  Applying  Ito's  differentiation  rule  [6]  to 


we  obtain 


X2^(t)  =  x^(t)x^(t)  -  E{x^(t)x^(t)} 
dxj^(t)  =  I  [a^j^X2^(t)  -t-a  j^X2^(t)3dt  +  (b^x^(t)  +bjxj(t))du 


where  a^j^  and  b^  are  the  components  of  and  respectively.  Defining  the 
%n(n+l)xJsn(n+l) -matrix  A2  and  the  %n(n-t-l)xn-matrix  B2  appropriately,  this  can  be  written 


dX2  =  A2X2dt  +  B2X^du  . 

Integrating  this  bilinear  equation  we  get  an  expression  of  type 


.(50) 


XzCt)  = 


f (t-T,t-0)du(cr)du(T)  , 


where  f  is  a  vector -valued  function.  Moreover 


rt 


W2(t-T,t-a)du(0)du(T)  , 


where  W2  is  obtained  from  g2  via  formula  (30)  .  Now.,  since  y2C0)  6X2,  there  are  real 
numbers  {C|^jk=l,2, . . .  ,%n(n-*-l)}  such  that 

W2(t,0)  =  I  Cj^fj^(t,0) 
k 

and  these  numbers  can  be  determined  by  known  methods.  Let  C2  be  the  %n(n+l) -dimensional 
row  vector  with  components  c^,-  Then 


Since  7  =  '^72’ 


y2(t)  =  C2X2(t)  . 


dx^  =  A^Xj^dt  +  B^du 
dX2  =  A2X.,dt  +  B2X^du 
y  =  C^Xj  +  C2X2 


,(51) 


(52) 


is  a  realisation  of  y,  for  x  = 


is  a  Markov  process.  Note  that  even  if  y^^  v\)ere 


zero  we  would  need  to  include  x^  is  the  state  process  x,  for  X2  by  itself  is  not  Niarkov. 
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/«v  H  + 

Let  X  be  the  state  process  corresponding  to  X^=E  (in  the  coordinate -system 

of  C52)).  It  is  shown  in  [1-3]  that,  for  any  Xj_ ,  =x^C0).  Therefore,  in 

view  of  the  definition  of  X2  and  the  fact  that  E^^  = E^l  * E^- ,  E^2x2C0)  =X2C03.  (E^^ 

and  E^2  applied  to  a  vector  means  that  the  projection  is  performed  componentwise.) 

Consequently  the  conditional  expectation  of  xCt)  siven  /”  is 

^  t 

E  ^x(t)  =  x(t)  [53) 

for  any  realization  (52) .  For  this  reason,  remembering  that  the  for;vard  generating 
H*  + 

process  of  E  is  the  innovation  v,  we  may  call  the  system 

dx^  =  AjX^dt  +  B^dv 

dx_  =  A„x^dt  +  B.,x,  dv  (54) 

.  y  =  ^1^1  ^  ^2^2 

the  steady  state  non-ttneav  fitter  of  (52) ,  and  we  have  shown  that  this  filter  is  in¬ 
variant  over  the  class  (52)  of  minimal  realizations.  A  similar  result  can  be  obtained 
for  backward  realizations  in  terms  of  v. 


8.  CONCLUDING  RB1ARKS 

The  purpose  of  this  paper  is  to  investigate  the  structural  aspects  of  the  nonlinear 
stochastic  realization  problem  and  to  clarify  basis  concepts.  This  is  a  first  step 
toward  a  nonlinear  realization  theory.  Hence  we  have  not  concerned  ourselves  with 
algorithmic  aspects  of  the  problem,  and  our  analysis  is  based  on  the  availability  of 
an  innovation  representation,  the  actual  determination  of  which  is  a  nontrivial  problem 
in  itself  (see  [20]). 

The  question  of  state  space  construction  needs  to  be  further  studied.  It  cuuld 
be  argued  that  condition  (4)  is  too  restrictive  since  there  could  well  be 

-splitting  subspaces  which  are  not  of  the  form  (4),  having  a  nonzero 
angle  with  some  (or  even  all)  Hence,  if  we  can  do  without  realizations  of  indi¬ 

vidual  y^  but  only  need  their  sum  y,  it  is  possible  that  we  are  missing  state  spaces 
of  smaller  size. 

Our  interest  in  the  nonlinear  realization  problem  emanates  from  its  potential 
value  as  a  conceptual  framework  for  certain  classes  of  nonlinear  filtering  problems. 

This  will  be  the  topic  of  a  future  study. 
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